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Abstract 

This paper introduces combinatorial representations, which generahse the notion 
of hnear representations of matroids. We show that any family of subsets of the same 
cardinality has a combinatorial representation via matrices. We then prove that any 
graph is representable over all alphabets of size larger than some number depending 
on the graph. We also provide a characterisation of families representable over a given 
alphabet. Then, we associate a rank function and a rank operator to any representation 
which help us determine some criteria for the functions used in a representation. While 
linearly representable matroids can be viewed as having representations via matrices 
with only one row, we conclude this paper by an investigation of representations via 
matrices with only two rows. 

Keywords: matroid; mutually orthogonal Latin square; rank function; entropy 

1 Definition and examples 

Combinatorial representations, defined below, generalise the notion of (linear) representa- 
tions of matroids. 

Definition 1 Let E be a set of n elements and B a family of r- element subsets of E. 

A combinatorial representation of {E, B) over a set X is defined to be an n-tuple of 
functions fi : X'^ — )■ X such that, for any r distinct indices ii, . . . ,ir E E , the map from X'' 
to X'' given by 

(Xl, X2^ . . . , Xr) ^ {fi^ {Xi, . . . , Xr), ■ ■ ■ , firi^i, . . . , Xr)) 

is bijective if and only if {ii, . . . , v} G B. 
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Usually we assume E = {1,2, . . . ,n}. We denote the map given by the displayed equation 
by fb, where b = {ii, . . . , v}- This slight abuse of notation will not be detrimental to the 
rest of the paper. Remark that the cases where r = 1 or r = n are trivial. 

Example 1 Let n = 4 and B = {{1, 2}, {3, 4}}. A combinatorial representation over a 
3-element set {a,b,c} is given by taking fi and /2 to be the two coordinate functions (that 
is, fi{x,y) = X and f2{x,y) = y), and /a and by the tables 
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Note that {E, B) is not a matroid. 

Remark 1. Suppose that b = {ii, . . . ,ir} & B. Define functions gi, for i E E, hj 

gi{xi, ...,Xr) = fi{yi, ■ ■ .,yr), 

where [yi, . . . , yr) is the inverse image of (xi, . . . , Xr) under the bijection /{,. These functions 
also define a combinatorial representation, with the property that gi. is the jth coordinate 
function. So, where necessary, we may suppose that the first r elements of E form a basis 
and the first r functions are the coordinate functions. This transformation can be viewed as 
a change of variables. 

Remark 2. The values of the functions fi are not significant; the definition could be written 
in terms of the partitions of given by these functions: VTj = {{x G A"" : fi{x) = a} : a E A}. 
Thus, we require that the meet (in the partition lattice) of r partitions is the partition into 
singletons if and only if the indices of these partitions form a set in B. 

Remark 3. The condition that the domain of the functions is A^ is also not essential; any 
set of cardinality will do (where q = \A\), since as in Remark 1 the functions corresponding 
to a set in B give this set the structure of a Cartesian power. 

Remark 4. Our point of view is similar to that of experimental design in statistics, where 
functions on (or partitions of) the set of experimental units are called factors, see [U Chapter 
10]. 

To take a very simple example, let us assume that trees in an orchard are laid out in 
a q X q square. Last year, q fertilizers were applied to the trees, using a Latin square layout, 
so that each fertilizer was used once in each row and column. This year, we want to test q 
pesticides on the trees, again in a Latin square layout; but, because of possible interaction 
between fertilizer and pesticide, we would like each combination to occur just once. We can 
regard rows, columns, fertilizers and pesticides as four functions from the set of trees to a 
set of size q (or four factors, each with q parts of size g); our requirement is that we have a 
representation of the complete graph of size 4. 

We return to this in Example 2. 
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Problem Suppose that a set family {E, B) has a combinatorial representation. Is there a 
simple condition which guarantees that {E,B) is a matroid? 

Theorem 1 A set family is a linearly representable matroid if and only if it has a combi- 
natorial representation by linear functions. 

Proof A linear representation of a matroid in an r-dimensional vector space over a field 
F associates a column vector (the ith column of the r xn matrix over F) with each clement 
of E so that a set of columns is linearly independent if and only if the corresponding set 
of elements of E is independent in the matroid. If we associate with each column vector 
(fli, . . . ,ar)^ the function from F^ to F given by {xi, . . . ,Xr) ^ ^ajXj, wc sec that the 
functions defined by the columns of the matrix satisfy the requirements of the definition of 
a combinatorial representation. 

Conversely, a set family represented by linear functions over a field F must be a matroid. 
To show this, we must verify the exchange axiom. Let Bi,B2 G B; as in the preceding 
remark, we may assume that the elements of Bi are the coordinate functions. Now con- 
sider the r — 1 functions fi for i G B2, i ^ k, for some fixed k G -82- These define a 
surjective function from F^ to F^~^. Take any non-zero vector in the kernel, and suppose 
that its Ith coordinate is non-zero. Then it is readily checked that B2 \ {k} U {/} is a basis. □ 

Example 2 A combinatorial representation of the uniform matroid U2,n on a set of size r 
is equivalent to a set of n — 2 mutually orthogonal Latin squares of order r. 

For suppose that f\ and f2 are the two coordinate functions. Then the maps {i,j) ^ 
ihfkihj)) o^nd {i,j) I—)- {fk{i,j),j) are bijective; so fk is a Latin square for all k. Also, 
the map t-> {fkihj)-fiihj)) is bijective for I ^ k; so the Latin squares fk and fi are 
orthogonal. The argument reverses. 

More generally, a representation of Ur,n over an alphabet A is equivalent to an (n, r,n — 
r + 1) MDS code over A. 

2 All set families are representable 

In this section, we show that any family is representable over some finite alphabet by giving 
an explicit construction via matrix linear functions. The proof is based on first representing 
the uniform matroid which contains all r-element subsets, and then removing subsets via 
cartesian products of representations. 

Proposition 1 Let {E,Bi) and {E,B2) be families of r- sets, which have representations 
over alphabets of cardinalities qi and q2 respectively. Then {E, Bi fl B2) has a representation 
over an alphabet of size qiq2- 
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Proof Suppose that (/e) and (ge) are representations of {E, Bi) and {E, B2) over al- 
phabets Ai and A2 respectively. Consider the functions he '■ {Ai x ^2)^' Ai x A2 given 
by 

he{{ai, fei), . . . , {ar, hr)) = (/e(ai, . . . , ar),ge{bi, br)). 

It is tedious but routine to show that, for any b O E, hh is a bijection if and only if both fb 
and Qh are bijections. So the functions {he : e G -E) represent {E, Bi fl B2)- □ 

We can now prove the theorem. 

Theorem 2 Any family is representable over some finite alphabet by matrix linear functions. 

Proof First of all, if = f/r,n, then it has a representation by linear functions. Otherwise, 
we can express B as 

S= n Ur,n\{c}- 

c€Ur.n\B 

We now give a linear representation of t/r,n\{c}. Without loss, let us assume c = 
{1, . . . ,r}. For any prime power p > ("ij) + 1, there are n — 1 vectors W2, . . . ,fn such 
that , . . . , are linearly independent for any choice of indices; moreover, there is Vi G 
(f2, . . . ,Vr) such that vi ^ (fj^, . . . , for any other choice of indices, since 

{v2,...,Vr)\ [j {Vi^,...,Vi^_^) 
ii,...,V_i 

These vectors thus form a linear representation. 

By applying the cartesian product construction in Proposition [H we obtain a matrix 
linear representation of {E,B). □ 

In order to illustrate our concepts and results, we consider the family of bases {E, Br,k) 
for any integers r. A;, where E = {0, . . . ,rk — 1} and 

Br,k = {{0, . . . , r - 1}, {r, . . . , 2r - 1}, . . . , {(A; - l)r, ...,kr- 1}}. 

The case k = r = 2 has been already studied in Example [H so we assume r > 3 or /c > 3 
henceforth. We first give a combinatorial representation of Br^k over Z^. Remark that the 
functions used in that representation are not matrix linear. 

Proposition 2 Let D = {x G : Xq = . . . = Xr-i} be the diagonal and let Xd be its 

characteristic function. Then the functions 

frm+s{x) =Xs+ mXD{x) 

for 0<s<r — 1, 0<m<A; — 1 form a combinatorial representation of Br^k over Z^. 
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Proof First, we prove that for each bm = {mr, . . . , (m+l)r — 1} G B, fb^ is a permutation 
of Z^. Remark that /b„(a;) G -D if and only if x G Z), then it is easily shown that fb^ is 
a bijection on D and also on 'Zl\D. Second, we prove that any = {fm^r+si, ■ ■ ■ , fmrr+sr) 
where b ^ B does not form a permutation of Z^. If Si = sj for some i j (without loss. 
Si = Sj = 0), then if /c > 3 

iUr, /m,r-)(0, 1, 0, . . . , 0) = = /^^,)(0, 2, 0, . . . , 0), 

and if /c = 2, r > 3, 

(/m,., /m,r.)(0, 1, 0, . . . , 0) = = /^^,)(0, 0, 1, . . . , 0). 

Therefore, if Sj = sj, fb is not a permutation. We now turn to the case where all Sj's are 
distinct (without loss Si = i), then there exist i j with rrii ^ rrij and 

/b(0, . . . , 0) = (mi, . . . , m^) = /b(mi, . . . , m^). 

□ 

3 Representations of graphs 

Represent ability is not a monotonic property of alphabet size. For example, a representation 
of the complete graph on 4 vertices is equivalent to a pair of orthogonal Latin squares; these 
exist over alphabets of sizes 3, 4 and 5 but not 6. However, we will prove the following. 

Theorem 3 Let {E,B) be a graph (a family of sets of cardinality 2). Then {E,B) has 
combinatorial representations over all sufficiently large finite alphabets. 

The theorem follows from the two propositions below. 

Consider set systems with r = 2, that is, graphs. We say that a representation (/g : e G E) 
of the graph {E, B) is idempotent if /e(x, x) = x for all x E A, where A is the alphabet. 

Proposition 3 Let {E,Bi) and {E,B2) be graphs, which have idempotent representations 
over alphabets of cardinalities qi and q2 respectively. Then {E, Bi fl B2) has an idempotent 
representation over an alphabet of size qiq2- 

The proof is the same as the one for Proposition [1] and hence omitted. 

For the second proposition, we need to recall the terminology of Richard Wilson [2]. A 
pairwise balanced design consists of a set X and a family C of subsets of X with the property 
that any two distinct elements of X are contained in a unique member of C. It is a PBD(i^), 
where is a set of positive integers, if the cardinality of every member of £ is contained in 
K. The elements of X and C are called points and lines respectively. 
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A set K of positive integers is PBD-closed if, whenever there exists a PBD(ir) with v 
points, then v & K. Given any set K of positive integers, define 



Wilson's main theorem asserts that a PBD-closed set K contains all but finitely many integers 
V such that a{K) 1^ — 1 and I3{K) \ v{v — 1). 

Proposition 4 Let {E, B) be a graph. Then the set of cardinalities of alphabets over which 
{E, B) has an idempotent combinatorial representation is PBD-closed. 

Proof Let K be the set of alphabet sizes for which the graph G = {E,B) has an 
idempotent representation. To show that K is PBD-closed, let {X,C) be a PBD{K) on v 
points; we have to show that v E K. 

By assumption, for each line L of the PBD, we have an idempotent representation (f^) 
with the alphabet L. We construct a representation (/e) with alphabet X by the following 
rule: 

• fe{x,x) = x; 

• a X ^ y, and L is the unique line containing x and y, then fe{x, y) — fe{x, y). 

We claim that this is a representation. 

Take Ci ^ 62- Suppose first that (/ei,/e2) is not a bijection. Then there exist distinct 
pairs {x,y) and {x',y') such that feXx,y) — fei{x',y') for i = 1,2. We consider three cases. 

• li x = y and x' = y', then x = /e,(a;, y) = fe^x', y') = x' . 

• Suppose that x = y and x' 7^ y' . Then feXx\ y') = feXx, u) = x, so x lies in the line L 
containing x' and y'. Then (/^, /^) is not a bijection, so {ei, 62} ^ B. 

• Suppose that x ^ y and x' 7^ y' . If fei{x, y) ^ fe2{x, y), then both of these points he in 
the line L containing x and y; hence x/ and y' also lie in this line. Now {f^^,f^J fails 
to be a bijection, and so {ci, 62} ^ B. So we can suppose that fei{x, y) = fe2{x, y) = z, 
say, with x,y,z G L. Then 



and again (/^, /^) fails to be a bijection. 

Conversely, suppose that {61,62} ^ B. Then, for any line L, is not a bijection; 

so if en f €2) is not a bijection. 

Our claim is proved, and with it, the Proposition. □ 



gcd{k -l:keK}, 
gcd{k{k -1) -.keK}. 



{fei{x,y),fe2{x,y)) = {fe,{z,z),fe2{z,z)), 
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Proof of the Theorem First, we observe that the complete graph Kn has an idempotent 
representation over any field with at least n elements: simply associate a field element A(e) 
with each e E E, and put fe{x,y) = \{e)x + (1 — X{e))y. 

Now we obtain an idempotent representation of the complete graph minus an edge: if 
ei and 62 are the two nonadjacent vertices, take the above representation of the graph on 
E\{ei}, and let /^^ = /ei- 

Next, an arbitrary graph is the intersection of graphs each of which is a complete graph 
minus an edge, and so has an idempotent representation, by Proposition [31 If the alphabet 
size of this representation is N, we obtain further representations over alphabets of size qN, 
for any q whose prime factors are each at least the number of vertices (by Proposition [3] 
again, intersecting with a complete graph). 

Now to prove the Theorem, we know from Proposition S] that the set K of alphabet sizes 
over which idempotent representations exist is PBD-closed; so by Wilson's theorem we have 
only to show that a{K) = 1 and f3{K) = 2. 

Suppose that a{K) > 1. Then every number of the form qN as above is congruent 
to 1 mod a{K), contradicting the fact that we can choose an arbitrarily large prime p = 
— 1 (mod a{K)). So a{K) = 1. The argument for P{K) is similar. □ 

4 Families representable over a given finite alphabet 
4.1 Characterisation 

We now characterize families which are representable over a given alphabet. Clearly, if {E, B) 
is representable over a finite alphabet A, then it is representable over any other alphabet 
with the same cardinality, so we assume A = Tjq unless otherwise specified. Furthermore, if 
(i?, B) is representable over A, then any induced subgraph of (i?, B) is as well. 

First of all, the definitions below easily generalize concepts for matroids. For any e E E, 
denote the set of bases containing e (taking e away) as B{e) = {b C E\{e} : \b\ = r — l,b U 
{e} G B}. 

Definition 2 • I is a loop if no basis contains I, that is, B{1) = 0. 

• li and I2 are parallel if each can be replaced by the other in a basis, that is, B{li) = B{l2) ■ 

• The subset I of E is dependent if no basis contains I. 

These definitions are absolute, that is, independent of representation. However, given a 
representation f = {fi : i E E) , we can say: 

• Hs an f-loop if the partition corresponding to / does not have all its parts of the same 
size. 

• li and I2 are f -parallel if the corresponding partitions are equal. (A statistician would 
say that these factors are aliased.) 
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• A subset / of E, with |/| = s, is f -dependent if the meet of the partitions indexed by 
I in the partition lattice does not have parts of size q"^'^. 

The representation-specific notions just defined imply the absolute notions given earlier. In 
the case of a linear representation of a matroid, the concepts are equivalent. 
We say that a set / is f -independent if it is not /-dependent. 

Remark: if {hyh} is /-independent, then the partitions corresponding to li and I2 are 
orthogonal in the statistical sense. The converse is false, but in fact h and I2 are independent 
if and only if the partitions are orthogonal and their join is the partition with a single part. 
See [H Chapter 10] for definitions. 

It is clear that the parallel relation is an equivalence relation. An obvious choice can 
be made to represent a family with loops and parallel elements: use constant functions for 
loops, and use the same function for any set of parallel elements. We clarify this idea below. 

Definition 3 A function f : ^ A is balanced if for all a & A, \f^^{a)\ = Two 
balanced functions f,g:A^^A are parallel if there exists a permutation t[ E Sa such that 
f = 'Kog. 

It is easily proved that / is balanced if and only if there exists a permutation a G Sa-i- 
such that f = g o a, where g is any coordinate function. 

Again, the parallel relation for functions is an equivalence relation, where each equivalence 
class contains q\ functions. Indeed, any balanced function can be viewed as a partition of 
A"^ into q"^^^ parts of q elements each (the set of pre-images f~^{a) for all a). Two functions 
are parallel if and only if they induce the same partition, and the equivalence class can be 
viewed as that partition, which we shall denote as /. 

Proposition 5 Let {E, B) he a family of bases of rank r and fi be a representation of [E, B) 
over A. Then 

1. If I E E is not a loop, then fi is balanced. Otherwise, fi can be chosen to be any 
imbalanced function. 

2. If I and m are not parallel, then fi and fm are not parallel either. Otherwise, fi and 
fm can be chosen to be parallel. 

Proof [1]) If / is not a loop, then there exists h E B which contains /, say h = {li = 
l,l2, ■ ■ ■ Jr}- Therefore, \fi^^{x)\ = 1 for all x E A^ which implies |/;~^(a;i)| = q^~^ for all 
Xi E A and hence /; is balanced. 

[2]) If fi and fm are parallel and /; can be extended to a bijection by /2, . . . , /r, then we 
easily obtain that /^ can also be extended by the same functions. Therefore, / and m are 
parallel. □ 

We refer to a family of bases as simple if it contains neither loops nor parallel elements. 
Any family of bases {E,B) can be turned into a simple one {E*,B*) by removing loops 
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and considering one element per parallel class. Proposition [5] then indicates that {E, B) 
is representable over an alphabet A if and only if {E*,B*) is representable over the same 
alphabet. 

By Proposition |5] above, all functions in the representation of a simple family are balanced 
and non-parallel to one another. As a corollary, they are all distinct, which shows that there 
are only finitely many representable simple families of a given rank and over a given alphabet. 
We can now characterise these families. 



Definition 4 Let P{q,r) be the set of partitions of into q equal parts and denote its ele- 
ments as /i, . . . , fk- LetM{q,r) = {P{q,r),B), where B = {{fi^, . . . ,7^} : f has q"" parts} . 

Clearly, Ai{q,r) is simple and representable over A: to any partition /, associate the 
corresponding function /j. Therefore, all its induced subgraphs are also representable over 
A (but may not be simple). Moreover, all representable families 'belong to' A4{q,r). 

Theorem 4 The family [E, B) of rank r is representable over A if and only if [E*, B*) is 
isomorphic to an induced subgraph of Ai{q,r) . 

Proof First, as mentioned above, {E,B) is representable over A if and only if {E*,B*) 
is representable over A. The latter is equivalent to the existence of functions (/e) such that 
{ii, ir} & B* if and only if is a bijection, which holds if and only if {/jj, . . . , fi^} 

is a basis of Ai{q,r). □ 



4.2 Properties of A4{q,r) 

Proposition E] below enumerates some properties of A4{q,r). 

Proposition 6 The hypergraph M{q,r) satisfies the following properties. 
1. The number of vertices is given by 



q\ yq'--'^, . . . , q"--^ J 

2. A4{q,r) is regular in the following sense. For any set of 1 < k < r partitions 
{fi,...,fk} which are in a basis of J^{q,r) , there are N{q,r; k,l) sets of I partitions 
{fk+i, ■ ■ ■ , fk+i} such that /i, . . . , fk+i belong to a basis, where 



r-k \ l'' 

N{q,r;k,l) ^ < 



^qv-kiy'' 
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3. In particular, Ai{q,r) is regular of valency 



N{q,r;l,r-1] 



Therefore, the number of bases in Ai{q,r) is given by 



B\ 



r\{q\] 



4- Ai{q,r) contains Ai{q,r — 1) in the following sense. Let gQ{x,Xr) = Xr for all x = 
{xi, . . . , Xr-^i); for any partition f G P(g, r — 1), let g E P{q, r) be defined as g{x, Xr) = 
f{x). Then {/i, . . . , e M{q,r-1) if and only if{go,gi,...,gr-i} G M{q,r). 

Proof 1. The number of balanced functions of A^' to A is exactly the multinomial 
coefficient {^r-i ^r-i) • Since any balanced function has exactly q\ parallel functions, we 
obtain the value of \P{q, r)\. 

2. Let us denote the function generated by fi,...,fk as / and the one generated by 
fk+i, • • • , fk+i as g. The set {/i, . . . , fk+i} belongs to a basis if and only if the function 
h : A"" ^ y^fc+i ig balanced. In other words, g must be balanced over all pre- images f~^{a) 
and hence can be viewed as q'' functions ga : A''^'^ — )• AK There are exactly {gr-kJ qr~k^i) 

r—k ^ 

choices for each ga, and hence [gr-k-l qr-k-i) choices for g. Accounting for all parallel 
functions and all permutations of {fk+i, • • • , fk+i}, we must divide by Z!(g!)' to obtain the 
value of N{q, r; k, I). 

3. N{q, r; 1, r — 1) is a special case of the property above, while \B\ is easily obtained by 
double counting. 

4. This is clear by definition of A4{q,r). □ 

This gives criteria on represent ability of families. 

Corollary 1 Let {E,B) be a simple family of rank r. If \E\ > \P{q,r)\ or if there exists 
a set X of k elements such that there are more than N{q, r; k, I) sets Y of I elements with 
X UY are in a basis of B, then {E, B) is not representable over any alphabet of size up to q. 

We remark that the clique number of A4{q,r) is of particular interest, as a clique of 
size n corresponds to an (n, r) MDS code over an alphabet of size q. Therefore, the MDS 
conjecture [3l Research Problem 11.4] can be recast a conjecture of the clique number of 
M{q, r). 

Proposition [7] below gives necessary and sufficient conditions for adjacency in the graph 
A^(g, r). For any two functions f,g : A'' — t- A, we define the Hamming distance between / 
and g as 

dH{f,g):=\{xeA^:f{x)^g{x)}\. 
Proposition 7 Let f,g& P{q,r), then the following are equivalent. 
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1. They are adjacent in Ai{q,r), i.e. {f,g} C b for some b E Ai{q,r) . 

2. The function g restricted to the set f^^{a) is balanced for all a E A. 

3. For all f parallel to f, duif.g) = {q - l)?''"^- 

4. There exists T ^ Sq such that \T\ = (g — 1)^, there exist a, b with an ^ b for all tt G T, 
and all permutation matrices {M^r ^ M.'^^'^ : vr G T} are linearly independent for which 
d{TT o f,g) = {q — ^)q^~^ for all vr G T. 

Proof The first two properties are clearly equivalent (see the proof of Proposition [6]). Let 
us prove that the second one implies the third one. Suppose g is balanced over all pre-images 
of / (and hence, over all pre-images of /' for any parallel /' of /). Then g agrees with /' 
in g*""^ positions on each pre-image; there are q pre-images, yielding dH{f',g) = q^ — q^~^- 
Also, the third property clearly implies the fourth one. 

Let us now show that the fourth property implies the second one. Foremost, recall that 
the subspace P of M'^^'^ spanned by all g x g permutation matrices has dimension (g — 1)^ + 1 
[1]. For all < z < g — 1, consider the matrix Ri G M'?^'^ whose only nonzero row i is equal 
to the all-zero vector; consider also Cj = Rj for all j. Together, these 2g matrices span a 
linear subspace CR of dimension 2g — 1 (the all-ones matrix 1'^^'? is in the intersection). Let 
us prove that dim(Ci? fl P) = 1 (again, the line spanned by l''^'^). Suppose N G CR fl P, 
then 

TT i j 

The sum of all entries on each row and each column is equal to s = XItt^tt- ^^^^ 
yields s = f3j + nai, from which we have = a is a constant; similarly, we have (3j = (5 
and N = {a + /3)li'"'. 

Since the (a, b) entry of is zero for all vr G T, the subspace Q spanned by the permu- 
tation matrices in T does not contain the all-ones matrix and hence Q © CR = M'^^'?. We 
now expand those matrices to g^-dimensional row vectors by concatenating their rows. We 
can represent g via a column vector 7 G M"^ where ji+qj = |/~"^(^) f^g~^{j)\- The fact that g 
is balanced is equivalent to Cj'j = g*""^ for all j; similarly, / balanced yields Ri'j = g*""^ for 
all < i < g — 2. Also, d{TT o f,g) = q^ — q^~^ yields M^^j = q^~^ for all tc E T. Overall, we 
obtain M7 = q''^^!" where M is non-singular. Thus there is a unique solution, 

given by 7 = g^'~^l'' i.e. when g restricted to the set f~^{a) is balanced for all a E A. O 

We remark that Condition [3] could be replaced by: dnif, g) is a constant for all /' parallel 
to /. This holds since double counting yields 

$^Mvro/,(7)=g^(g-l)(g-l)! 

for any function g : A^ ^ A (not necessarily balanced). However, this simplification cannot 
be applied to Condition HI 
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5 Rank and closure 
5.1 Rank 

This section generalises some concepts from matroid theory to the idea of combinatorial 
representations. Let us first define a rank function. 

Definition 5 Let {E, B) be a family of bases of rank r. A function rk : 2^ — )• [0, r] is a rank 
function for {E, B) if it satisfies the following conditions: 

• IfXCE, then < rk(X) < \X\. 

• IfXCYCE, then rk(X) < rk(F). 

• rk is submodular, i.e. if X and Y are subsets of E, then 

rk(Xur)+rk(Xnr) <rk(X)+rk(r). (1) 

• // |X| = r, then rk(X) = r if and only if X & B. 

The first three conditions are equivalent to: rk(X) is a polymatroid [5J. This definition 
implies that ii X CbeB, then \X\ = rk(X). 

It is easy to show that the only case where modular equality is reached for all subsets is 
for the uniform matroid Ur^r- Indeed, we have for any e E E 

Tk{E\{e}) = Tk{E) + rk(0) - rk({e}) = r - 1, 

and hence any basis must contain e. Thus only E itself can be a basis, and [E, B) = Ur^r- 

Below we show that every representation leads to a rank function. In particular, if / 
is a representation by matrix-linear functions, then its corresponding rank function takes 
rational values. 

Proposition 8 Let [E, B) be a family of bases and let f = (/i, . . . , /„) be a representation 
for it over a finite alphabet of size q. Then rj defined as 

r,{X):=HUx) = - ^ log, {^^| = r-g"^ E \fx\a)\\og,\f~^\a)\ 

where H is the q-ary entropy function, is a rank function for [E, B). 

Proof The proof of submodularity simply follows Shannon's inequality ^ Eq. (2.93)] 
and was already given in [7]. The other properties are straightforward. □ 

Remark 1. A subset J of is /-independent if and only if r/(/) = |/|. 
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Remark 2. The converse of Proposition [8] is not true: there exist rank functions which 
do not correspond to any combinatorial representation. This fact was proved in [7J, where 
they showed that any entropy function satisfies an additional inequality. In particular, they 
demonstrate that the rank function used in the proof of Theorem |5] below with p = 2 over 
the family {E = {1,2,3,4},,B = {{1,2}}) cannot be viewed as the entropy function of any 
representation for {E,B). 
Let us denote 

mr(X) = max 16 fl X|, 
bee 

when {E, B) is a matroid, this is its rank function. The rank function for the uniform matroid 
Ur,n is given by 

Mr(X) =min{r, |X|}. 
It is easily shown that for any family {E, B) and any rank function, we have 

mr(X) < rk(X) < Mr(X). 

Therefore, {E, B) is a matroid if and only if it has an integer-valued rank function. Theorem 
|5] below shows that for any divisor p > 2, there exists a rank function which takes values 
over the integers divided by p. Therefore, although matroids are viewed as special due to 
the submodularity of the rank function, it seems that the particularity of matroids actually 
resides in the integrality of the rank function. 

Theorem 5 For any p > 2, {E, B) has a rank function which takes values over the integers 
divided by p. 

Proof We claim that the function rk(X) defined by 



rk(X) = 




if |X| < r - lorX G B 
if\X\ = r,X ^B 
if IXI >r + l 



is a rank function for {E,B). Only the submodular inequality is in question, so we have 
to check that ([1]) holds for any X,Y C E (with rk(X) > rk(y) without loss of generality). 
This clearly holds (with equality) ii Y C X hence we assume this does not occur. We need 
to study four cases. 

• Case I: |X|, |F| < r - 1. Then 

rk(X) + rk(F) = |X| + |F| = |X U r| + |X n r| > rk(X U F) + rk(X n Y). 

• Case II: rk(X) = r - ^, \Y\ < r - 1. Then rk(X n F) = |X n F | < |r | - 1 and hence 

rk(X) + rk(r) > r + (|F| - 1) > rk(X U F) + rk(X n Y). 
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• Case III: rk(X) = rk(F) 



rk(X) + rk(F) 



-. Then |X fl Fl < r — 1 and hence 
= r + (r - -) > rk(X UY)+ rk(X n Y). 



• Case IV: rk(X) 



r. Then 



rk(X) + rk(y) 



r + rk(y) > rk(X UY) + rk(X n Y). 



□ 



Theorem [5] also shows that the supremum Mr(X) of all rank functions can be approached. 
Therefore, one can only expect to derive lower bounds on any rank function, but not any 
upper bound other than Mr(X). 

It seems difficult to design a generic rank function which takes significantly lower values 
than the one in Theorem O However, by an argument similar to that of Theorem \5\ we can 
show that the following is a rank function for [E, B): 



5.2 Bounds on rank functions 

In view of Proposition [HI studying rank functions in general can help determine some con- 
straints on the functions used in a representation. The lower bound of mr(X) for any rank 
function is tight for the rank function of a matroid. However, the exchange axiom implies 
that matroids are typically dense, i.e. the number of bases is large, and (leaving out trivial 
cases) for any basis b, there exists another basis b' with |6 fl 6'| = r — 1. Therefore, in order 
to obtain lower bounds on the rank function which differ from mr(X), we consider sparse 
families. 

Proposition [9] shows that a single "isolated basis" leads to significant gap between mr(X) 
and the rank of X for some X. The restriction on fractions only serves the sakes of conciseness 
and ease of presentation. 

Proposition 9 Let {E, B) he a family of rank r, and let 




/ = min max |6nc|. 

beB ceB,c^b 



Then there exists X <Z E such that \X\ = r, mr(X) = ^ 
for B, we have rk(X) > . Thus 



and for any rank function rk(X) 



rk(X) - mr(X) > 



r — I 



4 



provided all fractions are integers. 
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Proof Let b G B such that the intersection of any basis with b has size at most / and let 
ceB such that \bnc\= I. Define two sets X,Y C E as follows: X = (6 n c) U X' U Z and 
Y = {b D c) U y U Z , where X', Y' (1 b and Z C c have cardinality J = and none of the 
constituents intersect. Therefore, |X| = \Y\ = r and 

rk(X) + rk(r) > rk(X U F) + rk(X HY) = r + I + J, 

since 6 C X U y and X fl F = Z U (6 n c) C c. Without loss, suppose rk(X) > rk(r), 
then rk(X) > '""'"^"'"'^ = Let us now show that mr(X) = I + J = ^-i-^. First, we have 

|6nX| = |cnX| = I + J. Second, for any other d E B, we have the following inequalities 

\dnz\< \z\ < J, 
\dnx'\ + \dn{br\c)\ <\dnb\ < i, 

and hence 

\dnx\ = \dn{bnc)\ + \dnx'\ + \dnz\ < J + 1. 

□ 



The argument is strengthened in Theorem E] below, which exhibits a set with mr(X) = 1 
and yet for which the rank is arbitrarily close to r. Recall that a transversal for {E, B) is a 
set of elements in E such that any b E B contains at least one element of the transversal. 

Theorem 6 Let k denote the minimum size of a transversal for {E,B), then there exists a 
set of elements X C E such that \X\ = k, mr(X) = 1, and 

rk(X) > r (^1 - (l - i 

Proof We prove, by induction on 1 < i < A;, that there exists a set Xj = {ei, ...,€{} 

with mr(Xj) = 1 and rk(Xj) > = r ^1 — (l — ^Y^. This trivially holds for i = 1, let 

us assume this holds for i — 1. Let b = {e], . . . ,e[} be a basis which does not intersect 
with {ci, . . . , ej_i}. Note that such a basis exists, as {ei, . . . , ej_i} is not a transversal by 
definition of k. We have 

r 

5^rk(X,_i U {e^}) > rk(X,_i U 6) + (r - 1) rk(X,_i) > r + (r - l)r,_i, 
i=i 

and hence rk(Xj_i U {ej}) > 1 + (1 — l/r)rj_i = for some j. □ 
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5.3 Closure 

We can generalise the idea of parallelism to the concept of closure, defined below. The point 
is that the closure operator depends on the rank function, not only on the family of bases. 

Definition 6 For a rank function Tk{X) of{E,B), the closure associated to ik{X) of X C E 
is given by 

cU(X) = {eeE: rk(X U {e}) = rk(X)}. 
If the rank function is known, we simply write cl(X). 
Proposition 10 The closure satisfies the following properties. 

1. For allX CE, X C d{X). 

2. IfXC Y, then cl(X) C cl(r). 

3. For all X CE, cl(cl(X)) = cl(X). 

4. For all X CE, rk(cl(X)) = rk(X). 

5. For all X CE, c\{X) = E if and only «/rk(X) = r. 

Note that the first three properties indicate that cl(X) does behave as a closure operator, 
while the last two depend on the rank function. 

Proof Property [1] is trivial. 
[2j For any e G cl(X), we have 

rk(r) + rk(X) = rk(F) + rk(X U {e}) > rk(F U {e}) + rk(X), 

and hence rk(y U {e}) = ik{Y) and e G c\{Y). Therefore, cl(X) C c\{Y). 
m Denote the elements of cl(X)\X as ei, . . . , e^. We have 

k 

A;rk(X) = ^rk(XU {ci}) > rk(cl(X)) + (A; - l)rk(X), 

i=l 

and hence rk(cl(X)) = rk(X). 

Ei Let e G cl(cl(X)), then by Property H 

rk(X) = rk(cl(X)) = rk(cl(X) U {e}) > rk(X U {e}), 

and hence e G cl(X). 

[5l If rk(X) = E., then by Property |4]r = rk(cl(X)) = rk(X). Conversely, if rk(X) = r, 
then for any e e E, r = rk(X) = rk(X U {e}) and e G cl(X). □ 

It is worth noticing that all operators satisfying Properties (H [21 and [3] do not necessarily 
correspond to a rank function. An example is given below; in order to simplify notations, 
we shall remove brackets and commas when describing subsets of elements. 
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Example 3 Let E = {0, . . . , 5} and B^^2 = {012, 345}. We define the closure operator c as 
c{X) = X for allX CE except: 

c{X) = {0} U X ifX e {34, 134, 234, 35, 135, 235} 

c(X) = E if X contains a basis orX e {0145, 0245, 1234, 1235, 1245} 

It can be shown that c satisfies Properties Ul anc? O Furthermore, if \X\ = 3, then 
c{X) = E if and only if X & B. However, suppose r is a rank function generating the 
closure operator c, then 

4 > r(34) + r(35) = r(034) + r(035) > r(03) + r(0345) > 4, 

for 3 ^ c(0) and hence r(03) > r(0) = 1. 

We refer to a set equal to its closure flat. 

Proposition 11 We have the following properties. 

1. For any X (1 E, cl(X) is equal to the intersection of all flats containing X. 

2. The family of flats is closed under intersection. 

3. For any X,Y C E, c\{X UY) = cl(cl(X) U cl(F)) . 

Proof 1. Let F be a fiat containing X and let e G cl(X), then 

rk(F) + rk(X) = rk(F) + rk(X U {e}) > rk(F U {e}) + rk(X), 

and hence e G cl{F) = F. 

2. For any fiats F, G we have F fl G C cl(-F fl G). Moreover, we have 

cl(F n G) C cl(F) = F, 

and similarly cl(F n G) C G and hence cl(F nG) C FnG. Therefore, FnG = c\{F n G) 
is a fiat. 

3. Since XUF C cl(X)Ucl(r), we have cl(XUF) C cl(cl(X)Ucl(r)). On the other hand, 
d{X) and cl(y) are both subsets of d{XUY) and hence cl(cl(X) Ucl(F)) C cl(cl(Xur)) = 
cl(XUF). □ 

We can then define the lattice of flats, which is not necessarily semimodular (as the height 
function is not equal to the rank) but where the rank function is a semivaluation. Remark 
that although 

cl(XUF) = cl(cl(X) Ucl(y)), 
we do not necessarily have equality in the following: 

cl(X n F) C cl(X) n cl(F) = cl(cl(X) n cl(F)). 
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We finish this section by noticing that the closure associated to the rank function of a 
given combinatorial representation satisfies 



6 Representation by matrices with 2 rows 

In this section, we are interested in families which are "nearly" linearly representable ma- 
troids: those for which there is a representation with matrices with only two rows. By 
Proposition [H if {E, B) is the intersection of d linear matroids, then it has a representation 
by matrices of size d. Proposition [12] gives a counterexample of the converse for d = 2. 

Proposition 12 The family 82^3 is not the intersection of two matroids, yet it has a repre- 
sentation by matrices with 2 rows. 

Proof We first show that B := 82,3 is not the intersection of two matroids. Recall that 
E = {0, . . . , 5} and i3 = {{0, 1}, {2, 3}, {4, 5}}. Suppose on the contrary that B = Bi n B2, 
where Bi is a matroid on E for i E {1, 2}. Since r = 2, we view these as graphs, and we say 
that two vertices are adjacent if they form a basis. By the exchange axiom, for each vertex 
e and each basis b E B not containing e, e is adjacent in Bi and B2 to one or two vertices 
of b. Since such an edge does not appear in Bi H B2, we conclude that e is adjacent in Bi 
to exactly one vertex of b and is adjacent in B2 to the other vertex of b. Without loss of 
generality, let be adjacent to 2 and 4 in Bi. {0,2}, {0,4} G Bi, then {1,2}, {1,4} ^ Bi 
by applying the conclusion above to b = {0, 1} and e = 2 and e = 4, respectively. However 
{0,2}, {0,4} ^ B2 show that 0,2,4 are parallel in B2 and hence {2,4} ^ B2. Therefore, 
{2,4} G Bi while {1,2}, {1,4} ^ Bi, and the exchange axiom is violated: this is the desired 
contradiction. 

We now give a representation of {E, B) using matrices with 2 rows. Let p be any prime 
power, and consider A = GF(p)^, then we can express (x, y) G as (a;i, X2, Z/i, ^2) £ GF(p)'^. 
The functions representing {E,B) can be expressed as fi{xi,X2,yi,y2) = Fi{xi,X2,yi,y2)'^ , 



clAX) 



{eeE 
{eeE 
{eeE 
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Figure 1: A graph without representation by matrices with 2 rows, 
where Fj G GF(p)^^^ are given by 

Fo 
Fi 
F2 
F3 
F4 
F5 

□ 

We now prove that there exist famihes of bases which do not have a representation using 
matrices with 2 rows. The proof is based on the Ingleton inequahty [8] : for any four subsets 
Xi, . . . , X4 of a family representable using matrices, we have 

rk(Xi) + rk(X2) + rk(Xi U X2 U X3) + rk(Xi U X2 U X4) + rk(X3 U X4) 

< rk(Xi U X2) + rk(Xi U X3) + rk(Xi U X4) + rk(X2 U X3) + rk(X2 U X4). (2) 

Proposition 13 Let = {1, . . . , 7} and B = {16, 27, 34, 45, 53} as represented in Figure{^ 
Then {E, B) does not have a representation using matrices with 2 rows. 

Proof Any rank function rk(X) generated by a representation using matrices with 2 rows 
takes half-integer values. However, let rk(X) be a rank function with half-integer values and 
let us prove that it violates the Ingleton inequality in ([2]). First, we have rk(12), rk(17) < 1.5 
as neither are bases and 

rk(12) + rk(17) > rk(l) + rk(127) = 3, 
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which imphes that rk(12) = rk(17) = 1.5. 
Second, submodularity imphes 

rk(123) + rk(124) > rk(12) + rk(1234) = 3.5, 

and hence rk(123) = 2 or rk(124) = 2; without loss, say rk(123) = 2. By symmetry, we also 
obtain that rk(124) = 2 or rk(125) = 2; say rk(124) = 2. 

Denoting Xi = {i} for i = 1, . . . , 4, the Ingleton inequality is violated: 

rk(l) + rk(2) + rk(123) + rk(124) + rk(34) = 8 > 
rk(12) + rk(13) + rk(14) + rk(23) + rk(24) = 7.5. 

□ 

By the same technique, one can also show that the following family of bases is not 
representable using matrices with 2 rows: 

E = {0,1,...,8},B= {012, 034, 056, 078, 057, 135, 137}. 

The Ingleton inequality is violated by 

Xi = 01, X2 = 03, X3 = 05, X4 = 07. 

However, Proposition [H] shows that the Ingleton inequality cannot rule out representa- 
tions by matrices with 3 rows. 

Proposition 14 The rank function defined by 

{\X\ if |X| < r- lorX G i3 

r-| if\X\ = r,X^B (3) 

r if|X|>r + l 

satisfies the Ingleton inequality. 

Proof The fact that this is a rank function was proved in Theorem [51 We denote the left 
and right hand sides of the Ingleton inequality in ([2]) as L and R, respectively. The proof 
goes by considering cases based on the terms in R. It is important to note the symmetric 
roles of Xi and X2 on one hand and X3 and X4 on the other hand in Ingleton's inequality. 
In order to illustrate our calculations, square brackets indicate where we use the submodular 
inequality. 

Case I: rk(Xi U X2) = rk(Xi U X2 U X3). We then have 

R = rk(Xi U X2 U X3) + [rk(Xi U X3) + rk(Xi U X4)] + [rk(X2 U X3) + rk(X2 U X4)] 

> rk(Xi U X2 U X3) + rk(Xi) + rk(X2) + [rk(Xi U X3 U X4) + rk(X2 U X3 U X4)] 

> L. 

By symmetry, we also rule out the case where rk(Xi U X2) = rk(Xi U X2 U X4). 
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Case II: rk(Xi U X3) = rk(Xi U X2 U X3). We then have 

R = rk(Xi U X2 U X3) + [rk(Xi U X2) + rk(Xi U X4)] + [rk(X2 U X3) + rk(X2 U X4)] 
> L. 

By symmetry, we also rule out the cases where rk(XiUX4) = rk(XiUX2UX4), rk(X2UX3) = 
rk(Xi U X2 U X3), or rk(X2 U X4) = rk(Xi U X2 U X4). Remark that the case where some 
term in R is equal to r is contained in Case I or Case II, therefore all ranks are less than r 
in the right hand side in the next Cases. 

Case III: All the terms in R have cardinality at most r — 1, and hence their rank is equal 
to their cardinality. It is clear that the cardinality function satisfies the Ingleton inequality, 
for |X| = dim(Vx) for all X C E, where Vx is the subspace generated by the unit vectors 
{ei-.ie X}. 

Case IV: Some terms in R have rank r — ^, say 1 < A; < 5 of them, li k < 3, then 
rk(Xi U X2 U X3) = r < |Xi U X2 U X3I - 1 or rk(Xi U X2 U X4) = r < |Xi U X2 U X4I - 1. 
This holds since one of the two terms in L must have a greater rank than one of the k terms 
in R with rank r — ^. Then the term in L properly contains the according one in R, and its 
cardinality is at least r + 1. Let us denote 



R 


:= |Xi 


1 + IX2I + |Xi UX2 UX3 


1 + |Xi U X2 


UX4I 


+ 


1X3 


UX4I 


L 


:= |Xi 


UX2I + |Xi UX3I + |Xi 


UX4I + 1X2 


UX3I 


+ 


\X2 


UX4I 



Thus 

- k - 

R = R - -> R-l> L-l> L. 

lik e {4, 5}, then both Xi U X2 U X3 and Xi U X2 U X4 have rank r, and 

- k - 

R = R-->R-2>L-2>L. 
3 - 

□ 

7 Conclusion 

The representation of graphs in Section [3]) yields a couple of open questions. 

• Given a graph, what is the largest alphabet over which it is not represent able? 

• Does Theorem [3] hold for r-uniform hypergraphs with r > 2? 
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Also, after generalising loops and parallel elements in Section H] and rank functions and 
closure operators in Section El one wonders if more concepts from matroid theory could be 
generalised in the framework of combinatorial representations. 

The relation with information theory via rank functions and especially the submodular 
inequality needs to be further investigated. Indeed, a wealth of non-Shannon inequalities have 
been discovered recently [3 El HOl [11] , see [12] for a survey on this matter. However, it seems 
rather unclear how much more information can be drawn from all these new inequalities and 
how hard they are to manipulate. Similarly, non-Ingleton inequalities have been discovered 
for the dimension of intersections of linear subspaces [13j. Once again, what conclusions can 
we draw from these inequalities? 
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